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ABSTRACT 

In spite of the promise of new techniques for constraining the mass distribution in 
galaxy clusters, much remains to be learned from galaxy orbital velocities. This 
article reviews the theory of potential estimation in hot dynamical systems like 
galaxy clusters. An analysis is presented of the Coma cluster, based on a sample of 
~1500 galaxies with probable membership, of which ~450 have measured velocities. 
The Coma data are shown to be consistent with a model in which the dark matter 
density falls roughly as inside of ~1.5 Mpc (ffo = 50), with, perhaps, a transition 
to a slower falloff at larger radii. We find no significant evidence for a core in either 
the galaxy number densities or the mass distribution. 
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1. Introduction 

Since Zwicky's^ realization that the gravitational mass of the Coma cluster greatly exceeds 
the mass in stars, there has been no compelling reason to assume that the spatial distribution 
of the dark matter bears any relation to the density profile defined by the galaxies. But until 
recently, techniques for determining the gravitational potential $(r) have failed to yield strong, 
model-independent constraints on the dark matter distribution. Happily, this situation is about 
to change. New data from X-ray observatories (as described by K. Yamashita, H. Bohringer and 
R. Mushotzky at this meeting) are providing the first detailed information about the dependence 
of the gas temperature on position in galaxy clusters, information which will eventually translate 
into strong constraints on the potential. And new techniques - such as mass-mapping using 
gravitational lenses - are beginning to provide estimates of the matter distribution that are free of 
the assumptions of dynamical equilibrium and spherical symmetry (G. Soucail, J. Miralda-Escude 
and I. Small, this meeting). 

At the same time, the more classical approach based on measurements of discrete galaxy 
velocities bears re-examining due to the large data sets that are now becoming available. Here we 
review the theory of kinematical mass estimation in "hot" dynamical systems like galaxy clusters. 
We argue that much of the past work in this field has suff"ered from a confusion between "model 
fitting" and "statistical estimation." While it is easy to construct a model that is consistent, 
in some weak sense, with the kinematical data, such models are often highly nonunique. Most 
authors have based their mass estimation algorithms on the velocity dispersion profile defined by 
the galaxies, thus guaranteeing that their solution for $(r) will be degenerate. Furthermore, the 
estimation of quantities like p{r), the mass density profile, is strongly "ill-conditioned" in the sense 
understood by statisticians,^ meaning that the use of ad hoc parametrized models is guaranteed 
to severely bias the answer regardless of the quality of the data. While many of the published 
algorithms are very sophisticated, and a few are reasonably nonparametric, we show that most of 
these accomplish nothing more than an evaluation of the virial theorem, and hence tell us little 
that is unique or compelling about the distribution of dark matter. 

Even in a precisely spherical and relaxed cluster, reasonably model-independent estimates 
of $(r) and p{r) require large data sets as well as numerical algorithms that can use the complete 
information contained within a discrete set of positions and velocities. Below we analyze the Coma 
cluster and conclude that a fairly wide range of models is consistent with the currently existing 
kinematical data. In models where the galaxy velocities are close to isotropic near the cluster 
center, the inferred dark-matter density falls roughly as in the inner parts of the cluster, with 
no strong evidence for a core. At larger radii, r ^ 1.5 Mpc [Hq = 50 km s ~^ Mpc~^), we find 
marginal evidence for a slower falloff" of p with r, implying a total mass for the Coma cluster that 
could be several times larger than the standard value based on a naive application of the virial 
theorem. Larger kinematical samples may eventually allow us to choose between these models. 

Recent work (as reviewed by M. West at this meeing) has demonstrated the prevalence of 
substructure and departures from dynamical equilibrium in many galaxy clusters. Here we ignore 
such complications, even though they might seriously bias estimates of the mass distribution as 
inferred from kinematical data. Our aim is to demonstrate the difficulty of the potential estimation 
problem even in the idealized spherical a starting point for more sophisticated studies. 

2. Potential Estimation in Hot Systems 

Dynamical masses of stellar or galactic systems are usually estimated from the line-of-sight 
velocities of some set of luminous tracers orbiting in the overall potential. Rough estimates of the 
mass are often based on the virial theorem, which for a spherical system is 
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with (w^) the mean square velocity of some sample orbiting in the potential $(r). In a spherical 
system, (w^) is equal to three times the mean square line-of-sight velocity, independent of any 
assumptions about the velocity anisotropy. In nonspherical systems (which certainly include galaxy 
clusters), the virial theorem contains a geometrical factor that depends on intrinsic shape and 
orientation, both of which are typically unknown. More serious, however, is the unknown radial 
dependence of Without some information about the relative distribution of dark and luminous 
components, the virial theorem places only order-of-magnitude constraints on the total mass, even 
in the spherical case, and says virtually nothing about the central density or scale length of the 
matter that determines the potential. Most of the cluster "virial masses" quoted in the literature 
were derived from a form of the virial theorem which assumes that the mass is attached to, or has 
the same overall spatial distribution as, the galaxies. If mass does not follow light, these estimates 
are worth very little. 

One might hope to do better by constructing the velocity dispersion profile of the galaxies, 
since this function contains information about the variation with radius of the kinematical quanti- 
ties. But the extra information helps surprisingly little. Idealizing a galaxy cluster as a nonrotating 
spherical system, the Jeans equation states 



Eq. (2) contains the two velocity dispersions ar{r) and (T((r), measured along and tangential to the 
radius vector. The observed velocity dispersion at every projected radius is a complicated average 
along the line of sight of these two intrinsic components, and contains too little information to 
determine both functions independently. As a result, many different $(r)'s can be made equally 
consistent with an observed velocity dispersion profile by varying the assumed dependence of 
anisotropy on radius. This problem is widely recognized, but it is rarely emphasized just how 
great the indeterminacy is if no a priori constraints are placed on the galaxy orbits or on the 
distribution of the mass. Given perfect measurements of the surface density and velocity dispersion 
profiles of a set of galaxies in a spherical cluster, the central mass density is uncertain by several 
orders of magnitude, even if one imposes the reasonable constraint that p{r) be a declining function 
of radius.^ The total mass is uncertain by a smaller, but still large, factor - roughly an order of 
magnitude in the case of measured profiles like those in the Coma cluster. The shapes of the galaxy 
orbits in the more extreme models are not very likely to occur in nature, but the dependence of 
quantities like the mass density on the assumed kinematics is so strong that even mild departures 
from isotropy can imply huge changes in the mass density. 

However the amount of information in a large sample of line-of-sight velocities is much 
greater than that contained within the velocity dispersion profile alone. For example, if one knew 
the maximum line-of-sight velocity at every projected radius R, one would have a secure limit on 
depth of the potential at every intrinsic radius r = R, namely $(r = R) < —v'^^^{R)/2, independent 
of any assumptions about orbital shapes. In galaxy clusters, this approach is not very useful, since 
it requires measurement of the poorly-defined wings of the velocity distribution and even then only 
imposes a lower limit on the depth of the potential. 

We can do even better by making use of the complete distribution of line-of-sight velocities. 
In a spherical cluster, define the "projected distribution function" N{R, V) such that N{R, V)dV 
is the surface density at R of galaxies with radial velocities in the range V to V + dV. At a 
given R = Ro, the function N{Ro,V) is the so-called "line profile," the distribution of line-of-sight 
velocities at that projected radius. The integral of N over V is the surface density profile of the 
kinematic sample; the first moment of N over V is proportional to the line-of-sight rotational 
velocity profile (typically negligible for galaxy clusters); the second moment gives the line-of-sight 
velocity dispersion profile; etc. Although — remarkably, given the importance of the question 
— no one has yet proven mathematically that N{R, V) contains enough information to uniquely 
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determine the gravitational potential even in a spherical system, there is good reason to believe 
that it constrains the potential very tightly, and perhaps uniquely.^"® For instance, the line profiles 
N{Ro, V) have distinctly different shapes in systems dominated by eccentric or circular orbits (Fig. 
1). This fact suggests that we can determine the velocity anisotropy directly from the shapes of 
the line profiles, and then use the Jeans equation (1), with the known anisotropy, to infer the mass 
distribution. 



Core Halo 
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Fig. 1. Line-of-sight velocity distributions in a spherical cluster containing galaxies on three kinds of orbits. 
(1) Nearly circular orbits; (2) isotropic velocities; (3) strongly eccentric orbits. The left and right panels 
represent the appearance of the line profiles near the apparent center and in the halo, respectively. 

Since the velocity dispersion profile itself tells us little about $(r), it follows that almost all 
of the kinematically- derivable information about the potential is contained within these finer details 
of the line-of-sight velocity distribution - crudely speaking, in the deviations of the line profiles 
from Gaussians (although the line profiles in even a precisely isotropic cluster need not be exactly 
Gaussian). This is a discouraging result, since modest departures from spherical symmetry or 
equilibrium in a galaxy cluster might substantially affect the shapes of these curves. In addition, 
rather large data sets - containing, perhaps, several hundred or even thousand velocities - are 
needed to construct reliable estimates of N{R, V), even in a spherical cluster. 

A nagging question is how to interpret past work on mass distributions in galaxy clusters 
as inferred from galaxy velocities. Almost all of these studies were based on the velocity dispersion 
profile alone - the additional (and essential) information contained within N{R, V) was not used 
(perhaps in part because of the small size of most cluster data sets). We might expect these 
studies to have reached no very definite conclusions about the form of $(r), at least in cases where 
the velocity anisotropy was left as a free function to be determined by the data. On the contrary, 
however, many of these papers contain definite statements about the preferred form of the potential 
and of the galaxy kinematics - statements which appear to be justified, since the authors typically 
show that a particular model represents the data best. For instance, dynamical studies of the Coma 
cluster ''"^^ often conclude that the best-fit model is one in which mass approximately follows light, 
and the galaxy velocities are roughly isotropic. How should we interpret these statements? Is there 
some objective sense in which these "best-fit" models are more likely than other models? 

All of the Coma cluster studies cited above were "parametric" : a set of convenient math- 
ematical functions were postulated for representing the dynamical quantities of interest - e.g. 
the components of the galaxy velocity dispersion tensor, or the phase-space density of the 
galaxies, etc. - and the parameters of the assumed functions were then varied to maximize 
the goodness-of-fit of the spatially projected models to the number density and velocity dispersion 
profiles. 



One danger of parametric techniques is that there almost always exists a single choice of 
parameters for which the model fits the data best, even if the underlying problem is mathematically 
degenerate - that is, even if the data, assumed complete and error-free, are insufficient to constrain 
the solution uniquely. This is because a single member of the parametric family will usually lie 
closest, in function space, to the region containing the set of possible, exact solutions. This single 
function will be selected by the optimization routine as the one that best matches the data, even 
though a more flexible representation of the unknown function would have yielded a range of 
equally-good solutions. In such cases, the "best-flt" model has no physical signiflcance whatsoever; 
it is purely an artifact of the particular choice of parametric representation, since a difl"erent 
representation would have yielded a difl"erent "best-flt" model. 

A simple experiment demonstrates the relevance of these arguments to the potential es- 
timation problem. Suppose that we specify the observed number density and velocity dispersion 
proflles, S(-R) and o-p{R), of a spherical cluster. Suppose we represent the unknown distribution 
function describing the galaxies as a sum of basis functions, e.g. 



where n is the number of terms retained in the expansion. By allowing / to depend on the orbital 
angular momentum L we permit the velocity distribution to be anisotropic. For any assumed 
potential $(r), we can then vary the Cij to optimize the flt of the projected / to the measured 
proflles. Repeating this experiment with a family of trial potentials, we can flnd the pair of 
functions {$, /} that best reproduces T,{R) and (Tp{R). 

Fig. 2 shows the result for a $(r) deflned by two parameters, core radius and total 
mass M. The plotted contours represent the mean square deviation of the best-flt model from the 
"observed" proflles, in the assumed potential. When n, the number of basis functions representing 
/, is small, these contours single out a particular set of values Rc,M as most likely. (The lowest- 
order term in Eq. 3 gives the exact / from which the "observed" proflles were generated; thus, the 
peak in Fig. 2a lies very close to the true potential.) As n is increased, the contours become 
peculiarly elongated; when n = 15, there is no longer a single best-flt potential, but instead a 
curved region in {Rc,M) space along which the goodness of flt is nearly constant. Thus, when the 
representation of the unknown / is strongly parametrized, the algorithm gives what appears to be 
a unique solution for the potential; while a more flexible representation for / reveals that a large 
number of forms for the potential are equally likely. Clearly, the "best-flt" potential in the flrst 
frame is simply an artifact; a diff"erent choice of basis set in the expansion (3) would have singled 
out a diff"erent point on the (Re, M)-plane as optimum. 

Most of the published studies of the mass distribution in galaxy clusters were based on 
parametrized representations with n = 2 or 3. As Fig. 2a shows, it is not surprising that these 
studies were able to flnd "optimum" values for the parameters deflning the potential. Dejonghe^^ 
adopted the same series representation as in Eq. (3) for /, and used n = 9 terms in the expansion. 
His plots of goodness-of-flt for the Coma cluster (his flgures 1 and 2) look very much like Fig. 
2b here: Dejonghe also found a narrow ridge in potential parameter space along which was 
nearly constant, with no well-deflned maximum. Ironically, Dejonghe's treatment of the Coma 
data, which was algorithmically much superior to the others cited above, placed the weakest limits 
on the form of the potential, since his algorithm was the most flexible and hence best able to 
represent the wide range of /'s corresponding to diff"erent potentials. 

The open curve in Fig. 2c is the relation between M and R^ deflned by the virial theorem, 
Eq. (1). Since the quantity (v^) is determined uniquely by the adopted proflles T,{R) and o-p{R), the 
virial theorem implies a relation between the two parameters R^ and M that deflne the adopted 
potential. It is apparent that the ridge of nearly- const ant x^ in Fig- 2c is simply following this virial 
theorem curve. In other words, almost any potential that is consistent with the virial theorem can 
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reproduce the kinematical data equally well. Thus, one way to interpret the results of past studies 
of the mass distribution in galaxy clusters is to say that these authors - with the help of sometimes 
formidable numerical machinery - did nothing more than to evaluate the virial theorem. 



N = 3 N = 6 
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Fig. 2. Four attempts to find the "best-fit" potential from radial velocity data. In panels (a)-(c), the 
goodness of fit is defined as the mean square deviation of the theoretical S(-R) and (Jp(R) from the data; TV" 
is the number of basis functions used to approximate /. Panel (d) shows contours of constant likelihood, as 
defined in the text; the + marks the correct solution. The solid line in (c) is the curve defined by the virial 
theorem. 

[The fact that the "best-fit" models for the Coma cluster are often characterized by a 
nearly constant mass-to-light ratio and by a nearly isotropic velocity distribution has a simple 
explanation. It so happens that the Coma data are well described by such a model, as first shown 
by Rood et alJ Many authors - perhaps subconciously - adopt parametrized families for functions 
like f{E, L^) or ar{r) that contain as a special case an isotropic distribution function with a number 
density profile close to that of the Coma galaxies. Their optimization routine then returns this 
model, or one similar to it, as the best fit.] 

It is possible to modify the algorithm just described to make use of the complete information 
contained within a discrete set of positions and velocities. One simply replaces the functional 
describing the goodness of fit - taken above to be the mean-square deviation of the model profiles 
from the observed profiles T,{R) and (Tp{R) - by the likelihood that the particular set of iZ's and y's 



would have been observed if the model were correct. That is, one varies / and $ to optimize 
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where N{Ri, Vi) is the value of the projected distribution function corresponding to the model {/, $} 
at the data point Ri,Vi. This modification is moderately difficult from a technical point of view, 
since it requires the computation of the line profiles for every considered / and But Fig. 2d 
shows that the extra eff"ort is justified: the "most likely" potential, here computed from a sample of 
300 positions and velocities, is now well-defined and very close to the correct one. (The regions of 
zero probability in Fig. 2d correspond to potentials for which at least one of the measured velocities 
Vi exceeds the escape velocity at r = Ri.) By evaluating goodness of fit via the likelihood rather 
than the algorithm is forced to take account of the full distribution of line- of- sight velocities, 
and not just the dispersions, when judging the adequacy of a model. 

Although encouraging, this experiment is still parametric in its representation of $(r). 
There might easily exist some very diff"erent $(r), not contained within the family of functions 
considered, that is equally consistent with the data. In fact numerical experiments show that 
- while data sets of a few hundred velocities can place usefully tight constraints on a potential 
characterized by only two free parameters, as in the example presented above - such data can 
not be used to make very model-independent statements about $(r).^^ The reason is that many 
more than a few hundred velocities are needed to accurately determine the line-of-sight velocity 
distributions N{R,V). Furthermore, features in the line profiles that one might be tempted to 
attribute to anisotropy may be due in a real cluster to departures from equilibrium or spherical 
symmetry. Perhaps the most we can hope to accomplish in galaxy clusters is to falsify some 
interesting, simple model - e.g. a spherical model in which mass follows light, or in which the 
velocity distribution is everywhere isotropic, etc. - by comparing the detailed distribution of 
velocities in the model with that in the observed cluster. 

3. The Coma Cluster 

A. Biviano (this meeting) describes an ongoing project to measure a large number of galaxy 
velocities in the Coma cluster. One motivation for this study is to look for evidence in the kine- 
matics for substructure or departures from equilibrium. Here we present an analysis of the existing 
velocity data under the assumption that Coma is spherical and relaxed. We discuss the consistency 
of this assumption at the end. 

One simple way to compute $(r) from the kinematical data, without assuming ad hoc forms 
for the unknown functions, is to suppose that the distribution of galaxy velocities is everywhere 
isotropic. The intrinsic velocity dispersion is then given by the deprojection of the observed velocity 
dispersion profile: 



with j^(r) the spatial density of the galaxies, obtained by deprojecting the surface density: 
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The mass within r then follows from Eq, 



(2): 
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and the mass density is 



p{r) 



1 dM 
iirr^ dr 
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Here we are treating the galaxies as if they were ions in an X-ray emitting gas; the quantity (T^(r) 
plays the role of fcT(r)/m in the gas. 

Eqs. (5) - (8) define an "inverse problem" with a unique solution p{r), given smooth 
estimates of T,{R) and (Tp{R). Astronomers tend to solve such problems by postulating a model, 
then projecting it into observable space and comparing with the data. But statisticians are fond 
of noting that the use of parametrized models for the solution of inverse problems is extremely 
dangerous, even if (unlike the case discussed in the previous section) the inverse problem has a 
mathematically unique solution. The reason is that the quantities of interest are usually related 
to the data via diff"erentiations. For instance, p{r) depends on a second derivative of j^(r) (Eqs. 7 
and 8), and is itself a deprojection, i.e. derivative, of S (Eq. 6). Any small error in the choice of 
parametrized model to represent T,{R) or (Tp{R) will be amplified enormously in the computation of 
p{r). At one level, this means simply that accurate estimation of quantities like the mass density 
requires high-quality data. But regardless of the quality of the data, one has a much better chance 
of finding the correct solution if the modeling is carried out nonparametrically. This is because a 
nonparametric function estimate will tend to follow the curvature implied by the data, rather than 
imposing a shape that is likely to be subtly wrong, even if it fits the data well in a sense. 

A uniformly consistent way of solving ill-conditioned inverse problems like this one^^ is 
to construct smooth estimates of the input functions (here S and ap) directly from the data, 
using a nonparametric algorithm, then to operate mathematically on these smooth functions to 
produce estimates of the functions of interest (i.e. p). Confidence bands on the estimates can be 
constructed via the bootstrap.^"* The instability due to the inversion is dealt with by simply using 
a larger smoothing length on the data when constructing the estimates of $ or p, than would be 
appropriate for the estimates of the data functions S or ap themselves. By contrast, parametric 
techniques deal with the instability by brute force, and are almost certain to bias the answer 
regardless of the quality or quantity of the data. 
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Fig. 3. Nonparametric estimates of the number density profile j)(r) and line-of-sight velocity dispersion profile 
iTp(i2) defined by the Coma galaxies. The points in the right-hand panel are the 433 measured velocities 
from the compilation of T. Bird & M. King. Dashed lines are 90% confidence bands on the estimates. 



Accordingly, Fig. 3 shows nonparametric estimates of j^(r) and (Tp{R) for the Coma galaxies. 
The estimate j>(r) was computed from a sample of 1480 galaxies identified as likely members by 
Mellier et al.,^^ using a "maximum penalized likelihood" algorithm^'' with Abell's^® choice of cluster 
center. Interestingly, j>(r) does not look very similar to any of the functional forms that are usually 



fit to it, such as the lowered isothermal sphere. Instead there is a roughly power-law cusp inside 
of 10', with j> (X r~^. The estimate d-p{R) was computed from a set of 433 galaxies with measured 
velocities, as compiled by T. Bird and M. King. The "LOWESS" regression algorithm of W. S. 
Cleveland^^ was used. Both estimates are highly uncertain at radii iZ £ 3' due to the small number 
of bright galaxies near the center. 

Fig. 4a gives the estimated mass density as a function of radius in the isotropic model, 
obtained by applying eqs. (7) and (8) to i) and a-p. p{r) falls roughly as over most of the cluster 
in this model, although there is a hint of a core inside of a few hundred kpc; however the 90% 
confidence bands are consistent with a pure power law even at small radii. 

Although consistent - by construction - with the data as presented in Fig. 3, this isotropic 
model might still be inconsistent with the full set of velocities in Coma. Fig. 5 shows estimates of 
the line profiles N{R, V) at three radii in the Coma cluster, as computed with an adaptive kernel 
algorithm^° using galaxies from the Bird-King sample grouped in three radial annuli. Shown for 
comparison are the line profiles predicted by the isotropic model just discussed; the latter were 
computed from 

N{R,V)=^ / [«'V2 + y V2 + (9) 

Jii-2 Vr^ - Jo 

with the isotropic distribution function f{E) computed from /> and $ via Eddington's equation. (For 
clarity, we have omitted confidence bands on the model N{R,vys.) While the overall agreement 
is reasonable, there are some apparently signficant diff"erences. The velocity distribution near the 
center of Coma is more peaked near V = than in the model, perhaps indicative of an anisotropic 
subpopulation at the cluster center. At intermediate radii, the Coma velocity distribution is 
somewhat bimodal; at large radii, the distribution is once again nearly symmetric, but the mean 
has shifted by about 150 km s~^ from the mean velocity near the center. Furthermore, the model 
profile is somewhat less peaked than the true profile at large radii. 

-i-n-n 1 I 1 ij n-n-q 1 I 1 
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Fig. 4. Nonparametric estimates of the mass density profile p(r) in the Coma cluster, (a) Isotropic model; 
(b) anisotropic model (ra = 60' 2.4 Mpc). Dashed lines are 90% confidence bands on the estimates; Hq = 
50 km s"^ Mpc 

We can test the sensitivity of the predicted N{R, V) to the assumption of isotropy by varying 
the assumed kinematics - taking care to leave fixed the detailed dependence of S and ap on radius. 
In other words, we vary both $(r), as well as the anisotropy /3(r) = 1 — o-f /af, in such a way as to 
leave S and ap unchanged. One natural (though admittedly parametric) choice for describing the 



internal kinematics is o-f/a^ = 1 + r^/rl, where the anisotropy radius ra defines where the galaxy 
velocities begin to become strongly radial. For any choice of ra, one can then derive $(r) using a 
set of equations similar to those given above for the isotropic case.^^ There then exists a simple 
(though not unique) anisotropic distribution function f{E + L^/2rl) that yields the observed j>(r) 
in this potential. N{R,V) can be computed from this / via a Monte-Carlo algorithm. 

The results are shown in Figs. 4 and 5, for ra = 60' ^ 2.4 Mpc. The predicted mass 
density profile is still well described as p cx r~^ near the cluster center, but becomes much fiatter 
at large radii to compensate for the assumed anisotropy. (Values of ra less than about 40' yield 
a non-monotonic, and eventually negative, mass density profile; thus the model shown here with 
ra=60' is close to the maximally anisotropic allowed by this form of The predicted line-of-sight 
velocity distributions (Fig. 5) diff"er from those in the isotropic model only at large radii, where 
the preponderance of radial orbits predicts a more peaked profile, as in fact seen in the Coma 
data. However the predicted diff"erence between the two models, even at large radii, is not much 
greater than the uncertainty in the estimates of the Coma line profiles themselves, suggesting that 
the current sample of ~450 velocities is barely large enough to distinguish between these two very 
diff"erent models. This result is not surprising: since both the area and the width of the curves 
in each frame of Fig. 5 are fixed by construction, the only way these curves can diff"er is in their 
detailed shapes, and large numbers of discrete velocities are required to detect such deviations with 
certainty.^^ 
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Fig. 5. Observed and theoretical line pro- 
files for the Coma cluster. Heavy solid lines 
are nonparametric estimates of the distribu- 
tion of line-of-sight velocities in the Bird-King 
sample, with galaxies grouped into three ra- 
dial bins. Dashed lines are 90% confidence 
bands on the estimates. Thin solid lines are 
the line profiles from the two spherical models 
described in the text, with ra = oo (isotropic) 
and ra = 60'. The two models produce nearly 
identical line profiles for R ^ 40'; at large radii, 
the anisotropic model produces a more peaked 
profile. 



Because p{r) rises more steeply into the center of Coma than the galaxy number density, 
the local mass-to-light ratio increases sharply at small radii in both of the models presented here, 
by a factor of ~ 3-5 between 1 Mpc and 300 kpc. 



We note that the Coma line profiles are reasonably symmetric, with no strong indications 
of substructure. Larger velocity samples, such as that of Biviano et al, should tell us whether 
the slight asymmetries seen in Fig. 5 are indicative of substructure or are simply finite-sample 
fiuctuations. (The circular orbital time exceeds Hq^ at r ^ 100' ^ 4 Mpc in our isotropic model, so 
we would not expect Coma to be completely relaxed at the largest radii for which we have data. But 
even at smaller radii, the X-ray emission^"* and galaxy positions^^ show evidence for subgroupings 
and ongoing formation.) The presence of dynamically- significant substructure is potentially the 
most serious problem facing cluster mass determinations based on galaxy velocities. 

4. Conclusions 

Even in a perfectly spherical and stationary cluster, estimation of the radial dependence 
of the dark matter density from line-of-sight velocity data is a hard problem. One always has the 
freedom to "adjust" the assumed dependence of velocity anisotropy on radius to compensate for 
changes in the assumed mass distribution, in such a way that the galaxy velocity dispersion profile 
remains precisely unchanged. But most past studies have begun by reducing all of the kinematical 
data to a velocity dispersion profile; the "optimum" models found in these studies can accordingly 
be shown to be numerical artifacts, resulting from the use of ad hoc parametrized functions to 
describe the galaxy velocity distribution function or its moments. Almost all of the information 
about the radial dependence of the potential is contained within the fine details of the line-of-sight 
velocity distribution N{R, V). Furthermore, very diff"erent dynamical models - constructed so as to 
reproduce exactly the number density and velocity dispersion profiles defined by the galaxies - can 
yield very similar N{R,vys. Distinguishing between these diff"erent models, even in the idealized 
spherical case, requires large samples of discrete velocities, N ^ 1000. If the cluster is nonspherical 
or out of equilibrium, even larger velocity samples (and more sophisticated algorithms) would be 
needed to constrain $(r) in a model-independent way. We have shown that the existing velocity 
data for the Coma cluster are reasonably consistent with an equilibrium model in which the mass 
density falls off" as ~ for r £ 1.5 Mpc, with, perhaps, a more gradual falloff at larger radii; 
the total mass of the cluster is poorly defined by these data but could be several times the value 
derived by assuming that mass follows light. 
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